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Trace-driven nnemory simulation: a survey 
Richard A. Uhlig, Trevor N. Mudge 

June 1997 ACM Computing Surveys (CSUR), volume 29 issue 2 
Publisher: ACM Press 

Full text available- f9 pdf(636 1 1 KB) Additional Information: full citation , abstract , references , citings , index 
l^^;J_fc!__v : terms , review 

As the gap between processor and memory speeds continues to widen, methods for 
evaluating memory system designs before they are implemented in hardware are 
becoming increasingly important. One such method, trace-driven memory simulation, has 
been the subject of intense interest among researchers and has, as a result, enjoyed rapid 
development and substantial improvements during the past decade. This article surveys 
and analyzes these developments by establishing criteria for evaluating trac ... 



Keywords: TLBs, caches, memory management, memory simulation, trace-driven 
simulation 



2 Shade: a fast instruction-set simulator for execution profiling 
Bob Cmelik, David Keppel 
^ May 1994 ACM SIGMETRICS Performance Evaluation Review , Proceedings of the 
1994 ACM SIGMETRICS conference on Measurement and modeling of 
computer systems SIGMETRICS '94, Volume 22 issue i 
Publisher: ACM Press 

Full text available- f apdf(1.28 MB) Additional Information: full citation , abstract , references , citings , index 

terms 

Tracing tools are used widely to help analyze, design, and tune both hardware and 
software systems. This paper describes a tool called Shade which combines efficient 
instruction-set simulation with a flexible, extensible trace generation capability. Efficiency 
is achieved by dynamically compiling and caching code to simulate and trace the 
application program. The user may control the extent of tracing in a variety of ways; 
arbitrarily detailed application state information may be collected ... 

^ Disco: running commodity operating systems on scalable multiprocessors 
^ Edouard Bugnion, Scott Devine, Kinshuk Govil, Mendel Rosenblum 
>^ November 1997 ACM Transactions on Computer Systems (TOCS), Volume is issue 4 
Publisher: ACM Press 
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Full text available: Q pdf(40Q.76 KB) Additional Information: full citation, abstract , references, dtlnqs . index 

terms , review 

In this article we examine the problem of extending modern operating systems to run 
efficiently on large-scale shared-memory multiprocessors without a large implementation 
effort. Our approach brings back an idea popular in the 1970s: virtual machine monitors. 
We use virtual machines to run multiple commodity operating systems on a scalable 
multiprocessor. This solution addresses many of the challenges facing the system software 
for these machines. We demonstrate our approach with a prototy ... 

Keywords: scalable multiprocessors, virtual machines 



^ Implementation aspects of a SPARC V9 complete machine simulator 
Bill Clarke, Adam Czezowski, Peter Strazdins 

January 2002 Australian Computer Science Communications , Proceedings of the 

twenty-fifth Australasian conference on Computer science - Volume 4 
CRPITS '02, Volume 24 Issue 1 
Publisher: Australian Computer Society. Inc. , IEEE Computer Society Press 
Full text available: ^pdf(1.33MB) Additional Information: full citation , abstract , references , index terms 

In this paper we present work in progress in the development of a complete machine 
simulator for the UltraSPARC, an implementation of the SPARC V9 architecture. The 
complexity of the UltraSPARC ISA presents many challenges in developing a reliable and 
yet reasonably efficient implementation of such a simulator. Our implementation includes a 
heavily object-oriented design for the simulator modules and infrastructure, caching of 
repeated computations for performance, adding an OS (system call) emu ... 

Keywords: SNIP, SPARC V9 ISA, UltraSPARC, complete machine simulator, execution- 
driven simulation, object-oriented design 



Disco: running commodity operating systems on scalable multiprocessors 
Edouard Bugnion, Scott Devlne, Mendel Rosenblum 

October 1997 ACM SIGOPS Operating Systems Review , Proceedings of the sixteenth 
ACM symposium on Operating systems principles SOSP "97, volume 3i issue 

5 

Publisher: ACM Press 

Full text available: ^pdf(2.30MB) Additional Information: full citation , references , citings , index terms 



Devirtualizable virtual machines enabling general, single-node, online maintenance jj^ 
David E. Lowell, Yasushi Saito, Eileen J. Sannberg 

October 2004 ACM SIGARCH Computer Architecture News , ACM SIGOPS Operating 
Systems Review , ACM SIGPLAN Notices , Proceedings of the 11th 
international conference on Architectural support for programming 
languages and operating systems ASPLOS-XI, volume 32 , 38 , 39 issue 5 , 5 , ii 

Publisher: ACM Press 

Full text available: fi^pdfd 74.01 KB) Additional Information: full citation , abstract , references , citings , index 

terms 

Maintenance is the dominant source of downtime at high availability sites. Unfortunately, 
the dominant mechanism for reducing this downtime, cluster rolling upgrade, has two 
shortcomings that have prevented its broad acceptance. First, cluster-style maintenance 
over many nodes is typically performed a few nodes at a time, mak-ing maintenance slow 
and often impractical. Second, cluster-style maintenance does not work on single-node 
systems, despite the fact that their unavailability during mainte ... 
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Instruction fetching: coping with code bloat 

Richard Uhlig, David Nagle, Trevor Mudge, Stuart Sechrest, Joel Emer 

May 1995 ACM SIGARCH Computer Architecture News , Proceedings of the 22nd 

annual international symposium on Computer architecture ISCA '95, volume 

23 Issue 2 
Publisher: ACM Press 

Full text available- fi^ pdf(l.47 MB) Additional Information: full citation , abstract , references , citings , index 

Previous research has shown that the SPEC benchmarks achieve low miss ratios in 
relatively small instruction caches. This paper presents evidence that current software- 
development practices produce applications that exhibit substantially higher instruction- 
cache miss ratios than do the SPEC benchmarks. To represent these trends, we have 
assembled a collection of applications, called the Instruction Benchmark Suite (IBS), that 
provides a better test of instruction-cache performance. We discuss th ... 

Compilation and run-time systems: DELI: a new run-time control point 

Giuseppe Desoli, Nikolay Mateev, Evelyn Duesterwald, Paolo Faraboschi, Joseph A. Fisher 

November 2002 Proceedings of the 35th annual ACM/IEEE international symposium 

on Microarchitecture 
Publisher: IEEE Computer Society Press 

Full text available: ^ ^^^^^ ^ Additional Information: full citation , abstract , references , citings , index 

Publisher Site i^HTis 

The Dynamic Execution Layer Interface (DELI) offers the following unique capability: it 
provides fine-grain control over the execution of programs, by allowing its clients to 
observe and optionally manipulate every single instruction— at run time— just before it 
runs. DELI accomplishes this by opening up an interface to the layer between the 
execution of software and hardware. To avoid the slowdown, DELI caches a private copy of 
the executed code and always runs out of its own private cache. In ... 

Trap-driven memory simulation with Tapeworm II 
Richard Uhlig, David Nagle, Trevor Mudge, Stuart Sechrest 

January 1997 ACM Transactions on Modeling and Computer Simulation (TOMACS), 

Volume 7 Issue 1 
Publisher: ACM Press 

Full text available: ^ pdf(630.91 KB) Additional Information: full citation , references , index terms 



Keywords: Cache, TLB, memory system, simulation, trace-driven simulation, trap-driven 
simulation 



Virtual machine monitors: Xen and the art of virtualization 
Paul Barham, Boris Dragovic, Keir Fraser, Steven Hand, Tim Harris, Alex Ho, Rolf 
Neugebauer, Ian Pratt, Andrew Warfield 

October 2003 Proceedings of the nineteenth ACM symposium on Operating systems 

principles 
Publisher: ACM Press 

Full text available- l5lpdf(168 76 KB) Additional Information: full citation , abstract , references , citings , index 
^ terms 

Numerous systems have been designed which use virtualization to subdivide the ample 
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resources of a modern computer. Some require specialized hardware, or cannot support 
commodity operating systems. Some target 100% binary compatibility at the expense of 
performance. Others sacrifice security or functionality for speed. Few offer resource 
isolation or performance guarantees; most provide only best-effort provisioning, risking 
denial of service.This paper presents Xen, an x86 virtual machine monit ... 

Keywords: hypervisors, paravirtualization, virtual machine monitors 



Virtual machines: Scale and performance in the Denali isolation kernel 
Andrew Whitaker, Marianne Shaw, Steven D. Gribble 

December 2002 ACM SIGOPS Operating Systems Review, Volume 36 Issue SI 
Publisher: ACM Press 

Full text available: ' gpdf(1.91 MB) Additional Information: full citation , abstract , references , citings 

This paper describes the Denali isolation kernel, an operating system architecture that 
safely multiplexes a large number of untrusted Internet services on shared hardware. 
Denali's goal is to allow new Internet services to be "pushed" into third party 
infrastructure, relieving Internet service authors from the burden of acquiring and 
maintaining physical infrastructure. Our isolation kernel exposes a virtual machine 
abstraction, but unlike conventional virtual machine monitors, Denali does not ... 

12 Exokernel: an operating system architecture for application-level resource 
managennent 

D. R. Engler, M. F. Kaashoek, J. O'Toole 

December 1995 ACM SIGOPS Operating Systems Review , Proceedings of tlie fifteenth 

ACM symposium on Operating systems principles SOSP '95, Volume 29 
Issue 5 

Publisher: ACM Press 

Full text available: ^pdf(2.16MB) Additional Information: full citation , references , citings , index terms 




Trap-driven simulation with Tapeworm H 
Richard Uhlig, David Nagle, Trevor Mudge, Stuart Sechrest 

November 1994 ACM SIGPLAN Notices , ACi^ SIGOPS Operating Systems Review , 
Proceedings of the sixth international conference on Architectural 
support for programming languages and operating systems ASPLOS- 

VI, Volume 29 , 28 Issue 11,5 
Publisher: ACM Press 

Full text available* fi!|pdf(1.45 MB) Additional Information: full citation , abstract , references , citings , index 
^ terms 

Tapewornn II Is a software-based sinnulation tool that evaluates the cache and TLB 
performance of multiple-task and operating system intensive workloads. Tapeworm resides 
in an OS kernel and causes a host machine's hardware to drive simulations with kernel 
traps instead of with address traces, as is conventionally done. This allows Tapeworm to 
quickly and accurately capture complete memory referencing behavior with a limited 
degradation in overall system performance. This paper compares trap- ... 

Keywords: TLB, cache, memory system, trace-driven simulation, trap-driven simulation 



''^ Binary translation and architecture convergence issues for IBM system/390 
^ Michael Gschwind, Kemal Ebcioglu, Erik Altman, Sumedh Sathaye 
>^ May 2000 Proceedings of the 14th international conference on Supercomputing 
Publisher: ACM Press 
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Full text available: ^pdf(1.44 MB) Additional Information: full citation , abstract , references , index tenns 

We describe the design issues in an implementation of the ESA/390 architecture based on 
binary translation to a very long instruction word (VUW) processor. During binary 
translation, complex ESA/390 instructions are decomposed into instruction "primitives" 
which are then scheduled onto a wide-issue machine. The aim is to achieve high 
instruction level parallelism due to the increased scheduling and optimization opportunities 
which can be exploited by binary translation software ... 



Using the SimOS machine simulator to study complex computer systems 
Mendel Rosenblum, Edouard Bugnion, Scott Devine, Stephen A. Herrod 
January 1997 ACM Transactions on Modeling and Computer Simulation (TOM ACS), 

Volume 7 Issue 1 
Publisher: ACM Press 

Full text available: ■ g] Pdf(731.76 KB) Additional Information: full citation , references , dtings. index terms. 
^^^^-^^ review 



Keywords: computer architecture, computer simulation, computer system performance 
analysis, operating systems 



16 Workshop on architectural support for security and anti-virus (WASSA): Towards the Q 
issues in architectural support for protection of software execution 
Weidong Shi, Hsien-Hsin S. Lee, Chenghuai Lu, Mrinmoy Ghosh 
March 2005 ACM SIGARCH Computer Architecture News, Volume 33 issue i 
Publisher: ACM Press 

Full text available: ^ pdf(436.30 KB) Additional Information: full citation , abstract , references , index terms 

Recently, there is a growing interest in the research community to employ tamper- 
resistant processors for software protection. Many of these proposed systems rely on a 
specially tailored secure processor to prevent 1) illegal software duplication, 2) 
unauthorized software modification, and 3) unauthorized software reverse engineering, 
i^ost of these works primarily focus on the feasibility demonstration and design details 
rather than trying to elucidate many fundamental issues that are either "el ... 

Keywords: attack, copy protection, encryption, security, tamper. resistance 




''^ Improving the reliability of commodity operating systems 
^ Michael M. Swift, Brian N. Bershad, Henry M. Levy 

^ February 2005 ACM Transactions on Computer Systems (TOCS), Volume 23 issue i 
Publisher: ACM Press 

Full text available: ^ pdf(459.98 KB) Additional Information: full citation , abstract , references , index terms 

Despite decades of research in extensible operating system technology, extensions such as 
device drivers rennain a significant cause of systenn failures. In Windows XP, for exanriple, 
drivers account for 85&percent; of recently reported failures.This article describes Nooks, a 
reliability subsystem that seeks to greatly enhance operating system (OS) reliability by 
isolating the OS from driver failures. The Nooks approach is practical: rather than 
guaranteeing complete fault tolerance through ... 

Keywords: I/O, Recovery, device drivers, protection, virtual memory 
18 I 

Embra: fast and flexible machine simulation I 
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Emmett Witchel, Mendel Rosenblum 

May 1996 ACM SIGMETRICS Performance Evaluation Review , Proceedings of the 
1996 ACM SIGMETRICS international conference on Measurement and 
modeling of computer systems SIGMETRICS '96, Volume 24 issue i 
Publisher: ACM Press 

Full text available- 1 ^pdf(1.83 MB) Additional Information: full citation, abstract , references , citings, index 

This paper describes Embra, a simulator for the processors, caches, and mennory systenns 
of uniprocessors and cache-coherent multiprocessors. When running as part of the SimOS 
simulation environment, Embra models the processors of a MIPS R3000/R4000 machine 
faithfully enough to run a commercial operating system and arbitrary user applications. To 
achieve high simulation speed, Embra uses dynamic binary translation to generate code 
sequences which simulate the workload. It is the first machine simu ... 

19 Efficient instruction cache simulation and execution profiling with a threaded-code 
^ interpreter 
^ Peter S. Magnusson 

Decennber 1997 Proceedings of the 29th conference on Winter simulation 

Publisher: ACM Press 

Full text available: ^pdf{912.22 KB) Additional Information: full citation , references , index terms 



20 Virtual machine monitors: Implementing an untrusted operating system on trusted 
hardware 

David Lie, Chandramohan A. Thekkath, Mark Horowitz 

October 2003 Proceedings of the nineteenth ACM symposium on Operating systems 

principles 
Publisher: ACM Press 

Full text available* 153 pdf(280 87 KB) Additional Information: full citation , abstract , references , citings . Index 
* "^^^^ '' terms 

Recently, there has been considerable interest in providing "trusted computing platforms" 
using hardware'^— ~TCPA and Palladium being the most publicly visible examples. In this 
paper we discuss our experience with building such a platform using a traditional time- 
sharing operating system executing on XOM a processor architecture that provides 

copy protection and tamper-resistance functions. In XOM, only the processor is trusted; 
main memory and the operating system are not trusted. Our opera ... 

Keywords: XOM, XOMOS, untrusted operating systems 
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USPAT; 
EPO; JPO; 
DERWENT; 
IBMJTDB 


OR 


OFF 


2005/05/10 12:23 


S49 


33 


(kelly near edmund).in. 


US-PGPUB; 
USPAT; 
EPO; JPO; 
DERWENT; 
IBM_TDB 


OR 


OFF 


2005/05/10 12:23 
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S50 


20 


(cmelik near robert).in. 


US-PGPUB; 
USPAT; 
EPO; JPO; 
DERWEI^; 
IBM_TDB 


OR 


OFF 


2005/05/10 12:23 


S51 


17 


(wing near malcolm).in. 


US-PGPUB; 
USPAT; 
EPO; JPO; 
DERWENT; 
IBM TDB 


OR 


OFF 


2005/05/10 12:23 


S52 


43 


S49 or S50 or S51 


US-PGPUB; 
USPAT; 
EPO; JPO; 
DERWENT; 
IBl^JTDB 


OR 


OFF 


2005/05/10 12:23 


S53 


529265 


software 


US-PGPUB; 
USPAT; 
EPO; JPO; 
DERWENT; 
IBM TDB 


OR 


OFF 


2005/05/10 12:24 


S54 


405367 


hardware 


US-PGPUB; 
USPAT; 
EPO; JPO; 
DERWENT; 
IBM_TDB 


OR 


OFF 


2005/05/10 12:24 


S55 


273 


DLAT 


US-PGPUB; 
USPAT; 
EPO; JPO; 
DERWENT; 
IBM TDB 


OR 


OFF 


2005/05/10 12:24 


S56 


4526 


TLB 


US-PGPUB; 
USPAT; 
EPO; JPO; 
DERWENT; 
IBM_TDB 


OR 


OFF 


2005/05/10 12:24 


S57 


1267650 


target address 


US-PGPUB; 
USPAT; 
EPO; JPO; 
DERWENT; 
IBM TDB 


OR 


OFF 


2005/05/10 12:24 

• 


S58 


669119 


host instruction 


US-PGPUB; 
USPAT; 
EPO; JPO; 
DERWENT; 
IBMJTDB 


OR 


OFF 


2005/05/10 12:24 


S59 


43391 


emulat$4 


US-PGPUB; 
USPAT; 
EPO; JPO; 
DERWENT; 
IBM TDB 


OR 


OFF 


2005/05/10 12:24 
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S60 


229138 


S53 and S54 


US-PGPUB; 
USPAT; 
EPO; JPO; 
DERWENT; 
IBM_TDB 


OR 


OFF 


2005/05/10 12:24 


S61 


57 


S60 and S55 


US-PGPUB; 
USPAT; 
EPO; JPO; 
DERWENT; 
IBM TDB 


OR 


OFF 


2005/05/10 12:24 


S62 


15 


S61 and S56 


US-PGPUB; 
USPAT; 
EPO; JPO; 
DERWENT; 
IBM TDB 


OR 


OFF 


2005/05/10 12:24 


S63 


15 


S62 and S57 


US-PGPUB; 
USPAT; 
EPO; JPO; 
DERWENT; 
IBM TDB 

X^^l 1 1 ^^^^ 


OR 


OFF 


2005/05/10 12:24 


S64 


12 


S63 and S58 


US-PGPUB; 
USPAT; 
EPO; JPO; 
DERWENT; 
IBM TDB 


OR 


OFF 


2005/05/10 12:24 


S65 


3 


S64 and S59 


US-PGPUB; 
USPAT; 
EPO; JPO; 
DERWENT; 
IBM TDB 


OR 


OFF 


2005/05/10 12:24 


S66 


529265 


software 


US-PGPUB; 
USPAT; 
EPO; JPO; 
DERWENT; 
IBM TDB 


OR 


OFF 


2005/05/10 12:24 


S67 


405367 


hardware 


US-PGPUB; 
USPAT; 
EPO; JPO; 
DERWENT; 
IBM TDB 


OR 


OFF 


2005/05/10 12:24 


S68 


273 


DLAT 


US-PGPUB; 
USPAT; 
EPO; JPO; 
DERWENT; 
IBM_TDB 


OR 


OFF 


2005/05/10 12:24 


S69 


229138 


S66 and S67 


US-PGPUB; 
USPAT; 
EPO; JPO; 
DERWENT; 
IBM_TDB 


OR 


OFF 


2005/05/10 12:24 
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S70 


4526 


TLB 


US-PGPUB; 
USPAT; 
EPO; JPO; 
DERWEm"; 
IBM TDB 


OR 


OFF 


2005/05/10 12:24 


S71 


57 


S69andS68 


US-PGPUB; 
USPAT; 
EPO; JPO; 
DERWENT; 
IBM_TDB 


OR 


OFF 


2005/05/10 12:24 


S72 


1267650 


target address 


US-PGPUB; 
USPAT; 
EPO; JPO; 
DERWENT; 
IBM TDB 


OR 


OFF 


2005/05/10 12:24 


S73 


15 


S71 and S70 


US-PGPUB; 
USPAT; 
EPO; JPO; 
DERWENT; 
IBM_TDB 


OR 


OFF 


2005/05/10 12:24 


S74 


669119 


host instruction 


US-PGPUB; 
USPAT; 
EPO; JPO; 
DERWENT; 
IBM TDB 


OR 


OFF 


2005/05/10 12:24 


S75 


15 


S73 and S72 


US-PGPUB; 
USPAT; 
EPO; JPO; 
DERWENT; 
IBM TDB 


OR 


OFF 


2005/05/10 12:24 


S76 


12 


S75 and S74 


US-PGPUB; 
USPAT; 
EPO; JPO; 
DERWENT; 
IBM TDB 


OR 


OFF 


2005/05/10 12:24 


S77 


40 


(((translation adj loolcaside adj 
buffer) or TLB) same (consisten$4 
or coheren$4) same (software or 
hardware) same instruction$2) 


US-PGPUB; 
USPAT; 
EPO; JPO; 
DERWENT; 
IBM TDB 


OR 


OFF 


2005/05/10 12:24 


S78 


0 


S76 and S77 


US-PGPUB; 
USPAT; 
EPO; JPO; 
DERWENT; 
IBM_TDB 


OR 


OFF 


2005/05/10 12:24 


S79 


43 


S49 or S50 or S51 


US-PGPUB; 
USPAT; 
EPO; JPO; 
DERWENT; 
IBM TDB 


OR 


OFF 


2005/05/10 12:24 
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S80 



S77 and S79 



US-PGPUB; 
USPAT; 
EPO; JPO; 
DERWENT; 
IBM TDB 



OR 



OFF 



2005/05/10 12:24 
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